Glottal closure and opening detection for flexible parametric voice coding
نویسنده
چکیده
The knowledge of glottal closure and opening instants (GCI/GOI) is useful for many speech analysis applications. A Pitchsynchronous waveform encoding of voice is one such application. In this paper, a dynamic programming is employed to solve for the global close/open phase segmentation based on the polynomial parametric waveform of the derivative glottal waveform and its quasi-periodicity. Not only does the algorithm identify GCIs, but also the elusive GOIs, and as a by-product, the parameters of the glottal excitation waveform. The results show its effectiveness compared with a classical method. Its application to parametric voice encoding which allows for simple time-pitch scaling as well as voicing quality conversion is demonstrated.
منابع مشابه
Robust Structured Voice Extraction for Flexible Expressive Resynthesis a Dissertation Submitted to the Department of Electrical Engineering and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy
Parametric representation of audio allows for a reduction in the amount of data needed to represent the sound. If chosen carefully, these parameters can capture the expressiveness of the sound, while reflecting the production mechanism of the sound source, and thus allow for an intuitive control in order to modify the original sound in a desirable way. In order to achieve the desired parametric...
متن کاملVoice quality dimensions of pitch accents
Acoustic and electroglottographic (EGG) measurements were used to examine voice quality parameters during the production of the rising and falling pitch movements in German. The vowels / / and / / were studied in a single-speaker speech corpus. The acoustic measurements comprised an automatic spectral analysis of the glottal parameters open quotient (OQ), glottal opening (GO), skewness of glott...
متن کاملVoice quality dimensions o
Acoustic and electroglottographic (EGG) measurements were used to examine voice quality parameters during the production of the rising and falling pitch movements in German. The vowels / / and / / were studied in a single-speaker speech corpus. The acoustic measurements comprised an automatic spectral analysis of the glottal parameters open quotient (OQ), glottal opening (GO), skewness of glott...
متن کاملRegulation of glottal closure and airflow in a three-dimensional phonation model: implications for vocal intensity control.
Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds wer...
متن کاملAdvances in Glottal Analysis and its Applications
From artificial voices in GPS to automatic systems of dictation, from voice-based identity verification to voice pathology detection, speech processing applications are nowadays omnipresent in our daily life. By offering solutions to companies seeking for efficiency enhancement with simultaneous cost saving, the market of speech technology is forecast to be particularly promising in the next ye...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006